Multigrid Q-learning
نویسنده
چکیده
Reinforcement learning scales poorly when reinforcements are delayed. The problem of propagating information from delayed reinforcements to the states and actions that have an e ect the reinforcement is similar to the problem of propagating information in a discretized boundary value problem. Multigrid methods have been shown to decrease the number of updates required to solve boundary value problems. Here we extend Q-Learning by casting it as a multigrid method and show a reduction in updates required to reach a given error level in the Q-function for a simple, 1-d Markov decision task.
منابع مشابه
Multigrid Algorithms for Temporal Difference Reinforcement Learning
We introduce a class of Multigrid based temporal difference algorithms for reinforcement learning with linear function approximation. Multigrid methods are commonly used to accelerate convergence of iterative numerical computation algorithms. The proposed Multigrid-enhanced TD(λ) algorithms allows to accelerate the convergence of the basic TD(λ) algorithm while keeping essentially the same per-...
متن کاملPerformance-influence models of multigrid methods: A case study on triangular grids
Multigrid methods are among the most efficient algorithms for solving discretized partial differential equations. Typically, a multigrid system offers various configuration options to tune performance for different applications and hardware platforms. However, knowing the best-performing configuration in advance is difficult, because measuring all multigrid-system variants is costly. Instead of...
متن کاملMultigrid and Multilevel Methods for Nonconforming Rotated Q Elements
In this paper we systematically study multigrid algorithms and multilevel preconditioners for discretizations of second order elliptic problems using nonconforming rotated Q nite elements We rst derive optimal re sults for the W cycle and variable V cycle multigrid algorithms we prove that theW cycle algorithm with a su ciently large number of smoothing steps con verges in the energy norm at a ...
متن کاملNew Convergence Estimates for Multigrid Algorithms
In this paper, new convergence estimates are proved for both symmetric and nonsymmetric multigrid algorithms applied to symmetric positive definite problems. Our theory relates the convergence of multigrid algorithms to a "regularity and approximation" parameter a 6 (0,1] and the number of relaxations m. We show that for the symmetric and nonsymmetric V cycles, the multigrid iteration converges...
متن کاملA Compact Multigrid Solver for Convection-Diffusion Equations
diffusion equations using a nine-point compact difference scheme. implementation with multigrid, and carry out a Fourier We test the efficiency of the algorithm with various smoothers and smoothing analysis of the Gauss–Seidel operator. In Secintergrid transfer operators. The algorithm displays a grid-indepention 3 we present numerical experiments that demonstrate dent convergence rate and prod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994